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The Kochen-Specker theorem demonstrates that it is not possible to reproduce the predictions of 
quantum theory in terms of a hidden variable model where the hidden variables assign a value to 
every projector deterministically and noncontextually. A noncontextual value-assignment to a pro¬ 
jector is one that does not depend on which other projectors—the context—are measured together 
with it. Using a generalization of the notion of noncontextuality that applies to both measurements 
and preparations, we propose a scheme for deriving inequalities that test whether a given set of ex¬ 
perimental statistics is consistent with a noncontextual model. Unlike previous inequalities inspired 
by the Kochen-Specker theorem, we do not assume that the value-assignments are deterministic and 
therefore in the face of a violation of our inequality, the possibility of salvaging noncontextuality 
by abandoning determinism is no longer an option. Our approach is operational in the sense that 
it does not presume quantum theory: a violation of our inequality implies the impossibility of a 
noncontextual model for any operational theory that can account for the experimental observations, 
including any successor to quantum theory. 
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Although measurements in quantum theory cannot, 
in general, be implemented simultaneously, one can still 
ask whether the outcomes of such incompatible measure¬ 
ments might be simultaneously well-defined within some 
deeper theory. To formalize this deeper theory we use 
the framework of ontological models ^ which general¬ 
izes the notion of a hidden variable model. Contrary 
to naive impressions, it is possible to find models of 
this sort that reproduce quantum predictions. Problems 
only arise if one makes additional assumptions about the 
model. The Kochen-Specker theorem [2] famously de¬ 
rives a contradiction from an assumption we term KS- 
noncontextuality. Consider a set of quantum measure¬ 
ments, each represented by an orthonormal basis, such 
that some rays are common to more than one basis. It is 
assumed that every ontic state —a complete specification 
of the properties of the system, including values of hid¬ 
den variables—assigns a definite value to each ray, 0 or 
1, regardless of the basis (i.e. context) in which the ray 
appears. If a ray is assigned the value 1 (0) by an ontic 
state A, the measurement outcome associated with that 
ray is predicted to occur with probability 1 (0) when any 
measurement including the ray is implemented on the 
system in ontic state A. It follows that for every basis, 
precisely one ray must be assigned the value 1 and the 
others the value 0. 

The assumption that the ontic state assigns a deter¬ 
ministic outcome to each measurement is the greatest 
shortcoming of the Kochen-Specker theorem. Recall that 
determinism is not an assumption of Bell’s theorem Hi]. 
This is evident from derivations of the Clauser-Horne- 
Shimony-Holt inequality [5]. Even in Bell’s original 1964 
article [3], where deterministic assignments play an im¬ 


portant role, determinism is not assumed but rather de¬ 
rived from local causality and the fact that quantum the¬ 
ory predicts perfect correlations if the same observable 
is measured on the two parts of a maximally entangled 
state (an argument from Einstein, Podolsky and Rosen 
that Bell simply recycled [7] ). It was shown in Ref. [5] 
that one can make a similar argument about determin¬ 
ism in noncontextual models: rather than assuming it, 
one can derive it from a generalized notion of noncon¬ 
textuality and from two facts about quantum theory: (i) 
the outcome of a measurement of some observable is per¬ 
fectly predictable whenever the preceding preparation is 
of an eigenstate of that observable, and (ii) the indis- 
tinguishability, relative to all quantum measurements, of 
different convex decompositions of the completely mixed 
state into pure states. 

Hence, in any proof of the Kochen-Specker theorem 
one can replace the assumption of determinism with the 
generalized notion of noncontextuality and the quantum 
prediction of perfect predictability. If perfect predictabil¬ 
ity is indeed observed, then in the face of the result¬ 
ing contradiction, one must give up on noncontextuality. 
This contrasts with earlier proofs where one could al¬ 
ways salvage the generalized notion of noncontextuality 
by abandoning determinism. 

Of course, no real experiment ever yields perfect pre¬ 
dictability, so this manner of ruling out noncontextuality 
is not robust to experimental error. Following ideas in¬ 
troduced in recent work [5] , we show how to contend with 
the lack of perfect predictability of measurements and de¬ 
rive an experimentally-robust noncontextuality inequal¬ 
ity for any uncolourability proof of the Kochen-Specker 
theorem. 
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FIG. 1. Each of the 18 rays is depicted by a node, and 
the 9 orthonormal bases are depicted by 9 edges, each a loop 
encircling 4 nodes. There is no noncontextual assignment of 
Os and Is to these nodes such that for every edge precisely one 
node receives the value 1. For instance, we have depicted a 
noncontextual assignment of Os and Is to 17 of the rays, which 
cannot be completed to an assignment to all 18 rays because 
neither value (0 or 1) can be assigned to the remaining ray 
(marked by X): while one basis in which it appears requires 
it to take the value 0, the other requires the value 1. 


Review of the Kochen-Specker theorem. The 

original proof of the KS theorem required 117 rays in 
a 3d Hilbert space [2]. We use the much simpler proof 
in Ref. [TU] as our illustrative example. It involves a 
4d Hilbert space and 18 rays that appear in 9 orthonor¬ 
mal bases, each ray appearing in two bases. One can 
visualize this as a hypergraph with nodes represent¬ 
ing the rays and edges representing orthonormal bases 
(Fig. [^a)). There is no 0-1 assignment to these rays 
that respects KS-noncontextuality: the hypergraph is un- 
colourable (Fig. [^b)). Of course, if the value assigned to 
a ray were allowed to be 0 in one basis and 1 in the other 
(a KS-contextual value assignment) then one could evade 
the contradiction. 

Is it possible to test the possibility of a KS- 
noncontextual ontological model experimentally? One 
view is that the Kochen-Specker theorem is not amenable 
to an experimental test. It merely constrains the possi¬ 
bilities for interpreting the quantum formalism [nma. 
However, this answer is clearly inadequate. One can and 
should ask: what is the minimal set of operational predic¬ 
tions of quantum theory that need to be experimentally 
verified in order to show that it does not admit of a non¬ 
contextual model? 

We show that this minimal set is a far cry from the 
whole of quantum theory and is therefore consistent with 
many other possible operational theories. As such, the 
no-go result we derive shows that none of these theories 
admit of a noncontextual model. Furthermore, if this 
set of predictions is corroborated by experiment, then 
this implies that any future theory of physics that might 
replace quantum theory also fails to admit of a noncon¬ 
textual model. 

We begin with some definitions. An operational the¬ 
ory is a triple {V,M,p) where T" is a set of prepara¬ 
tions, A1 is a set of measurements, and p specifies, for 
every pair of preparation and measurement, the proba¬ 


bility distribution over outcomes for that measurement if 
it is implemented on that preparation. Specifically, if we 
denote the set of outcomes of measurement M by K-m, 
then VP S V, VAf G A4, p is a function of the form 
p(,|P,A/) : [0,1]. 

An ontological model of an operational theory 
{V,M,p) is a triple (A,where A denotes a space 
of possible ontic states for the physical system (here pre¬ 
sumed to be discrete), where p, specifies a probability 
distribution over the ontic states for every preparation 
procedure, that is, VP G V, /r(-|P) : A —[0,1], such 
that J2\eAl^iMP) = 1) 9,nd where ^ specifies, for ev¬ 
ery measurement, the conditional probability of obtain¬ 
ing a given outcome if the system is in a particular ontic 
state, that is, VM G M, ^{k\M, •) : A —>• [0,1], such that 
J2keKM ~ 1- order for the ontological model 

to reproduce the statistical predictions of the operational 
theory, it must be the case that 

p(fc|P,M) = ^C(fc|M,A)Ai(A|P) (1) 

AgA 

for all P G P, and M G M.. 

We denote the event of obtaining outcome k of mea¬ 
surement M by [/cjMj. If \k\M] is assigned a deter¬ 
ministic outcome by every ontic state in the ontological 
model, i.e., if ^{k\M, •) : A —>• {0,1}, then it is said to be 
outcome-deterministic in that model, and if this holds for 
all k, then M is also said to be outcome-deterministic. 

We explain how to derive an experimental test of non- 
contextuality using a sequence of four refinements on the 
standard account of the KS theorem: 

Operationalizing the notion of KS- 
noncontextuality. In a KS-noncontextual model 
of operational quantum theory, the value (0 or I) 
assigned to the event [k\M] by A is the same as the 
value assigned to the event [/c'jM'j whenever these two 
events are represented by the same ray of Hilbert space 
(here, we are assuming that M and M' are maximal 
projective measurements). We get to the crux of the 
notion of KS-noncontextuality, therefore, by describing 
the operational grounds for associating the same ray 
to [k\M] as is associated to [fc'jM'j. Letting \lk\M and 
Ilfe'iM' represent the corresponding rank-I projectors, 
the grounds for concluding that Iik\M = ^k'\M' are 
that = tr(pnfc/|jv/') for an appropriate set 

of density operators p. It is clearly sufficient for the 
equality to hold for the set of all density operators, but 
it is also sufficient to have equality for certain smaller 
sets of density operators, namely, those complete for 
measurement tomography, or simply tomographically 
complete. 

What then should the operational grounds be for as¬ 
signing the same value to [k\M] and [fc'jM'j in a gen¬ 
eral operational theory, where preparations are not rep¬ 
resented by density operators? The answer, clearly, is 
that the event [fc|M] occurs with the same probability as 
the event [fc'jM'j for all preparation procedures of the 
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system, 


p{k\M, P) = p{k'\M\ P) for all P S IP, (2) 

or equivalently, if this holds for a subset of V that is to- 
mographically complete. In this case, we shall say that 
[k\M] and are operationally equivalent, and de¬ 
note this as [k\M] ~ We can therefore define a 

notion of KS-noncontextuality for any operational the¬ 
ory as follows: an ontological model (A, of an oper¬ 
ational theory {V,M.,p) is KS-noncontextual if (i) oper¬ 
ational equivalence of events implies equivalent represen¬ 
tations in the model, i.e., [k\M] ~ => ^{k\M, A) = 

^{k'\M',X) for all A G A, and (ii) the model is outcome- 
deterministic, ^{k\M, •) : A —>• {0,1}. 

The operational equivalences among the measurements 
that are relevant for the 18 ray proof of the KS theorem 
depicted in Fig.j^a) are made explicit in Fig.j^a), where 
every measurement event [fc|M] is represented by a dis¬ 
tinct node, and a novel type of edge between nodes spec¬ 
ifies when two events are operationally equivalent. This 
representation affords a nice way of depicting contextual 
value assignments, such as in Fig. [^b). It follows that 
any operational theory that admits of nine four-outcome 
measurements that satisfy the operational equivalence 
relations depicted in Fig. [^a) fails to admit of a KS- 
noncontextual model. 



FIG. 2. (a) Nine four-outcome measurements. A blue loop 

encircling a set of nodes implies that these nodes denote out¬ 
comes of a single measurement. A yellow hashed region en¬ 
closing a set of nodes implies that the corresponding events 
are operationally equivalent, (b) A depiction of the fact that 
there is no outcome-deterministic noncontextual assignment 
of values in {0,1} to the measurements. The depicted value- 
assignment breaks the assumption of noncontextuality for the 
pair of highlighted nodes 


that the ontic state merely assigns a probability distri¬ 
bution over outcomes, and take this distribution to be 
the thing independent of the context. In Ref. [5], this 
revised notion of noncontextuality was termed measure¬ 
ment noncontextuality: 

Measurement noncontextuality is satisfied by 
an ontological model (A,/r,^) of an opera¬ 
tional theory {P,M.,p) if [fc|M] ~ im¬ 

plies £,{k\M, A) = ^{k’\M', A) for all A G A. 

Here, ^{k\M, •) G [0,1] (and not merely {0,1}). Outcome 
determinism is not assumed. 

Justifying outcome determinism for perfectly 
predictable measurements. Outcome determinism 
can, however, be justified sometimes if one assumes a 
notion of noncontextuality for preparations [5]. First, a 
definition: P and P' are said to be operationally equiv¬ 
alent, denoted P ~ P', if for every measurement event 
P assigns the same probability to this event as P' 
does, that is, 

p{k\M, P) = p{k\M, P') for all k&K-M, for all M G At. 

(3) 

A preparation-noncontextual ontological model is then 
defined as follows: 

Preparation noncontextuality is satisfied by 
an ontological model (A,/r,^) of an opera¬ 
tional theory {V,M.,p) if P ~ P' implies 
^(A|P) = /r(A|P') for all A G A. 

Insofar as both measurement and preparation noncontex¬ 
tuality are instances of operational equivalence implying 
ontological equivalence, it is most natural to assume both, 
that is, to assume universal noncontextuality. 

It was shown in Ref. [8] that in a preparation- 
noncontextual model of quantum theory, all pro¬ 
jective measurements must be represented outcome- 
deterministically. Here, we provide a version of this ar¬ 
gument for the 18 ray construction. 

Suppose that one has experimentally identified thirty- 
six preparation procedures organized into nine ensembles 
of four each, {Pi^k ■ i G {1,... ,9}, fc G {!,... ,4}}, such 
that for all i, measurement Mi on preparation Pi^k yields 
the fcth outcome with certainty, 

Vi,Vfc : p(A;|Mj,Pj,fc) = 1. (4) 


Defining a notion of noncontextuality with¬ 
out outcome determinism. The essence of noncon- 
textuality is that context-independence at the opera¬ 
tional level should imply context-independence at the 
ontological level. The operationalized version of KS- 
noncontextuality commits one to more than this, how¬ 
ever, because it makes an additional assumption about 
what sort of thing should be independent of context at 
the ontological level, namely, a deterministic assignment 
of an outcome. However, one can equally well assume 


We call this property perfect correlation. In quantum 
theory, it suffices to let Pi^k be the preparation associated 
with the pure state corresponding to the fcth element of 
the ith measurement basis. 

Define the effective preparation as the procedure 

obtained by sampling fc uniformly at random and then 
implementing Pi^k- We now suppose that one has exper¬ 
imentally verified the operational equivalence relations 

J>(ave) _ ^(ave) foj. alH, G {1, . . . , 9}. (5) 
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Contending with the lack of perfect predictabil¬ 
ity in real experiments. In real experiments, the 
ideal of perfect correlation described by Eq. 0 is never 
achieved, so we cannot derive a contradiction from it. 
However, Eq. ([^ is logically equivalent to the following 
inference: 

universal noncontextuality -I- operational equivalences 
—> failure of perfect correlation. (10) 

This means that the amount of correlation, averaged over 
all i and fc, will necessarily be bounded away from 1. It 
is this bound that is the operational noncontextuality 
inequality. For the 18 ray example, we prove that 


FIG. 3. 36 preparation procedures organized into nine ensem¬ 
bles of four each. A node at the end of a set of lines emanat¬ 
ing from the elements of an ensemble represents the effective 
preparation procedure achieved by sampling uniformly from 
the ensemble. A yellow region encircling a set of nodes implies 
that these preparations are operationally equivalent. 


These equivalences are depicted in Fig. They hold 
in our quantum example because the simply cor¬ 

respond to different ways of preparing the completely 
mixed state. 

Given Eq. ([^ and the assumption of preparation non¬ 
contextuality, there is a single distribution over A, de¬ 
noted v{X), such that 

^(;^| J^(ave)) ^ alH G {1, . . . , 9}. (6) 

Given the definition of , it follows that 

foralH G {1,...,9}. (7) 

k 

Furthermore, recalling Eq. Q, for the ontological model 
to reproduce Eq. 0 , we must have 

Vz,V/c:^e(fc|M„A)MA|P*.fc) = l. (8) 

A 

Because every A in the support of ;/(A) appears in the sup¬ 
port of fi{X\Pi^k) for some k, it follows that if ^{k\Mi,X) 
had an indeterministic response on any such A, we would 
have a contradiction with Eq. ®. Gonsequently, for all i 
and k, the measurement event must be outcome- 

deterministic for all A in the support of v{X). 

To summarize then, if one has experimentally verified 
the operational equivalences depicted in Figs. da) and 
and the measurement statistics described in Eq. Q , 
then universal noncontextuality implies that the value 
assignments to measurement events should be determin¬ 
istic and noncontextual, hence KS-noncontextual, and we 
obtain a contradiction in the usual manner. The argu¬ 
ment can be summarized thus 

universal noncontextuality -I- operational equivalences 
-I- perfect correlation —)■ contradiction. (9) 


9 4 

p{k\Mi,Pi^k) < |- 


( 11 ) 


i—1 k—1 


To test the assumption of noncontextuality, therefore, 
one must measure the correlation p{k\Mi, Pi^k) for all 
i and k, but one must also verify that the operational 
equivalences depicted in Figs, [^a) and hold, because 
only in this case does the assumption of noncontextuality 
imply that the inequality © should hold. 

We now outline how the bound in Eq. (11) is obtained. 
First, we use Eq. 0 to express A in terms of A) 

and ^(AjPi^fc). Defining the max-predictability of a mea¬ 
surement M given an ontic state A by 


C(M,A) = max ^(fc'|M,A), 

k' 


( 12 ) 


we deduce that 






= E(^9Ec(^-^)j 


(13) 


where we have used Eq. 0. 

The measurements can have indeterministic responses, 
^{k\M,-) : A —> [0,1], but measurement noncontextual¬ 
ity implies that ^{k\Mi,X) = ^(fc'|Mi/,A) for the oper¬ 
ationally equivalent pairs {[fc|Mi], [fc'jMi/]}. There are 
many such assignments. Every unit-trace positive opera¬ 
tor, for instance, specifies an indeterministic noncontex¬ 
tual assignment via the Born rule, and there are other, 
nonquantum assignments as well, such as the one de¬ 
picted in Fig.[^ Consider the average max-predictability 
achieved by the assignment of Fig Here, six mea¬ 
surements have max-predictability 1, while three have 
max-predictability 1. This implies that 1 ({Mi, A) = 

^(6 • 1 -b 3 • |) = |. As we demonstrate in Appendix 
[A] no ontic state has a higher average max-predictability 
than that of Fig. 0 so that maxA Ez A)) < |, 
thereby establishing the noncontextual bound on A. The 
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FIG. 4. Example of a noncontextual outcome-indeterministic 
assignment to the measurements. 


logical limit for the value of A is 1, so the noncontextual 
bound of I is nontrivial. The quantum realization of the 
18 ray construction achieves A = 1. 

Note that if an experiment fails to suppress noise suf¬ 
ficiently, then it may not succeed in violating our non- 
contextuality inequality. This simple criterion of opera¬ 
tional meaningfulness fails for previous attempts at deriv¬ 
ing noncontextuality inequalities m, a point we discuss 
further in Appendices and Although we have used 
the 18 ray uncolourable set of Ref. m as an example, the 


scheme described can be used to turn any proof of the 
Kochen-Specker theorem based on an uncolourable set 
into an experimental inequality. An issue we haven’t ad¬ 
dressed is that in practice no two measurement events are 
assigned exactly the same probability by each of a tomo- 
graphically complete set of preparations, nor do any two 
preparations assign exactly the same probability distribu¬ 
tion over outcomes to each of a tomographically complete 
set of measurements. The solution to this problem is de¬ 
scribed in related work BM- A question that remains 
is: how does one accumulate evidence that a given set of 
measurements or preparations is indeed tomographically 
complete? This question represents the new frontier in 
the project of devising strict experimental tests of the 
assumption of noncontextuality. 
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Appendix A: Proof of the inequality 

We can summarize our main result—a derivation of 
a noncontextuality inequality from the proof of the 
Kochen-Specker theorem for the 18 ray uncolourable set 
of Fig. [T] —by the following theorem: 

Theorem. Consider an operational theory (V,A4^p). 
Let {Mi G A4 : i G {1,...,9}} be nine four-outcome 
measurements. Let [fcjMi] denote the kth outcome of 
the ith measurement, where k G {1,...,4}. Let {Pi^k G 
V : i G {1,...,9}, k G {1,2, 3,4}} be thirty-six prepa¬ 
ration procedures, organized into nine sets of four. Let 
piave) g .p preparation procedure obtained by sam¬ 

pling k G {1,2,3, 4} uniformly at random and implement¬ 
ing Pi^k- 

Suppose that one has experimentally verified the op¬ 
erational preparation equivalences depicted in Fig. 
namely, 


logical model as 


f{k\M,,\)yi{\\P,^k). 


(A4) 


i—\ k—1 A 


Using the definition of the max-probability f{Mi,X), 
given in Eq. (12), we have 


(iU 


(AS) 


i=l \ 


k=l 


Assuming that one experimentally verifies the opera¬ 
tional preparation equivalences of Eq. the assump¬ 

tion of preparation noncontextuality implies that 

p(A|p(^™)) = = ■■■ = (A6) 

It follows that there exists a single distribution, which we 
denote i^(A), such that 


^(A|p(^™^) = iy{X) for alH S (1,..., 9}. 


(A7) 


Recall that is the preparation procedure that sam¬ 

ples k uniformly from {1,2,3,4} and implements Pi^k- 
Given that the probability of the system being in a given 
ontic state A given the preparation Pi^k is /r(A|Pi_fc), and 
given that the probability of Pi^k being implemented is j 
for each value of A:, it follows that the probability of the 
system being in a given ontic state A given the prepara¬ 
tion is /x(A|F’^®'™^) = Combining 

this with Eq. (A7), we conclude that 

i^/r(A|Pj,fc) = j^(A) for alH e {1,... ,9}, (A8) 


and therefore that 


p(ave) ^ p(ave) 
-"I — ^2 


p(ave) 
^9 5 


(Al) 


and the operational equivalences depicted in Fig. 
namely, 


A 2^1 

This in turn implies 


(A9) 


[k\M,] ~ [k'\M,,], (A2) 

for the eighteen pairs specifed therein. 

If one assumes that the operational theory admits 
of a universally noncontextual ontological model, that 
is, one which is both measurement-noncontextual and 
preparation-noncontextual, then the following inequality 
on operational probabilities holds 


A< max-^C(A7^,A). 


(AlO) 


Z=1 


Assuming that one experimentally verifies the oper¬ 
ational measurement equivalences of Eq. (A2), the as¬ 
sumption of measurement noncontextuality implies that 

akm,x) = ak'm',x), ( ah ) 


^ = (A3) 

2—1 k—l 

We now provide the proof. Eor clarity, we expand on 
some of the steps presented in the main article. 

Using Eq. 0 , the quantity A can be expressed in terms 
of the distributions and response functions of the onto- 


for the eighteen pairs of operationally equivalent mea¬ 
surement events {[k\Mi], [k'\Mi']) specifed in Fig. [^a). 

It is useful to simplify the notation at this stage. We 
introduce the variable n G {1,..., 18} to range over the 
eighteen operational equivalence classes of measurement 
events. We introduce the shorthand notation 

Wk = ^{k\Mi,X) = f,{k'\Mi>,X), 


(A12) 





or, more explicitly, as 


7 



5i:C(M..A) 

= + iiieix{w4,W5,WQ,W7} 

9 

+ max{ui7, ws, W9,t«io} + niaxlwio, wn, W 12 , wis} 

+ max{uii3, wii, wi5, wiq] + maxjwie, wn, wis, wi} 

+ max{uii8, ■IC2, wg, wii} + maxlica, UI5, W12, wu} 

+ max{ui6, W8, Wi5,i(;i7}]. (A17) 


FIG. 5. A choice of labelling of the eighteen equivalence 
classes of measurement events. Here, Wk denotes the prob¬ 
ability assigned to the equivalence class labelled by k in a 
noncontextual outcome-indeterministic ontological model. 


for the probability assigned to the /tth equivalence class, 
where the dependence on A is left implicit. The variable 
K enumerates the equivalence classes in Fig.j^a) starting 
from [l|Mi] and proceeding clockwise around the hyper¬ 
graph, as depicted in Fig.[^ 

In this notation, the constraint that each response 
function is probability-valued, ^{k\Mi.X) € [0,1], is sim¬ 
ply 


0 < < 1, V/cG 18}, (A13) 


while the constraint that the set of response functions for 
each measurement sum to 1, 
be captured by the matrix equality 

Zw = u (A.14) 


where 

w 


{wi, 


,Wis 


u 


(1, 

b 1,1,1 

1,1, 

1 , 1 ; 


and 
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(A15) 

Finally, we can express 

the quantity to be maximized 

as 


= i^^max^w^, (A16) 


The matrix equality of Eq. (A14| implies that 
there are only nine independent variables in the set 
{wi, ri; 2 , • ■ • j W 18 } and that these satisfy linear inequal¬ 
ities. The space of possibilities for the vector w therefore 
forms a nine-dimensional polytope in the hypercube de¬ 
scribed by Eq. (A13). 

The value of | on any of the interior 

points of this polytope will be an average of its values at 
the vertices because it is a convex function of w. There¬ 
fore, to implement the maximization over A, it suffices to 
maximize over the vertices of this polytope. 

Following a brute-force enumeration of all the ver¬ 
tices of the polytope, the maximum possible value 
of I X[i=i is found to be |. An ex¬ 

ample of a vertex achieving this value is w = 
( 1 , 0 , 0 , 0 , 1 , 0 , 0 , 0 , 5 , 0 , 0 , 0 , 1 , 0 , 0 , 0 )"^, which is de¬ 

picted in Fig. 1^ This concludes the proof. 

Our proof technique can be adapted to derive a similar 
noncontextuality inequality correponding to any proof of 
the KS theorem based on the uncolourability of a set 
of rays of Hilbert space. One begins by completing ev¬ 
ery set of orthogonal rays into a basis of the Hilbert 
space, and then forming the hypergraph depicting the 
orthogonality relations among these rays (the analogue 
of Fig. [^. One then forms the hypergraph decipting all 
of the measurements events, with one type of edge denot¬ 
ing which events correspond to the outcomes of a single 
measurement, and the other type of edge denoting when 
a set of measurement events are operationally equivalent 
(the analogue of Fig. [^a)). One then associates a set 
of preparations with every measurement in the hyper¬ 
graph, one preparation for every outcome. For each such 
set of preparations, we define the effective preparation 
that is the uniform mixture of the set’s elements, and we 
presume that all of the effective preparations so defined 
are operationally equivalent (as is the case in quantum 
theory, where the effective preparation for every set cor¬ 
responds to the completely mixed state). We consider 
the correlation between the measurement outcome and 
the choice of preparation in the set associated with that 
measurement, averaged over all measurements. This av¬ 
erage correlation is the quantity A that appears on the 
left-hand side of the operational inequality. 

The uncolourability of the hypergraph means that 
there are no noncontextual deterministic assignments to 
the measurement events, hence the polytope of proba- 








bilistic assignments to the measurement events has no 
deterministic vertices either. Each vertex of this poly¬ 
tope, that is, each convexly-extremal probabilistic assign¬ 
ment, will necessarily yield an indeterministic assignment 
to some of the measurement events. Using the opera¬ 
tional equivalences and the assumption of universal non- 
contextuality, one can infer from this that the average 
correlation A is always bounded away from 1. For any 
uncolourable hypergraph, a quantum realization would 
achieve the logical limit A = 1 hy construction, so the 
noncontextuality inequality we derive is necessarily vio¬ 
lated by quantum theory in each case. 

One can understand this violation as being due to the 
fact that assignments of density operators that are in¬ 
dependent of the preparation context can achieve higher 
predictability for the respective measurements than as¬ 
signments of probability distributions over ontic states 
that are independent of the preparation context. This is 
the feature of quantum theory that allows it to maximally 
violate the noncontextual bound of A < 5/6. 

Appendix B: Robustness of the noncontextuality 
inequality to noise 

How much noise can one add to the measurements and 
preparations while still violating our noncontextuality in¬ 
equality? We answer this question here assuming that 
the experimental operations are well-modelled by quan¬ 
tum theory. According to quantum theory, 

p{k\Mi,Pi^k) =T^T:{Ek\MiPi,k), (Bl) 

where denotes the positive operator representing 

the measurement event and pi^k denotes the den¬ 

sity operator representing the preparation Pi^k- To be 
precise, for every t, the set {E^Mi}k is a positive op¬ 
erator valued measure, so that 0 < E^^j. < /, and 
E^nj. = I, and for every i and k, pi^k is positive, 
Pi,k P 0, and has unit trace, Tipi^k = 1- 

In quantum theory, a noiseless and maximally infor¬ 
mative measurement is represented by a POVM whose 
elements are rank-1 projectors, that is, 

Ek\Mi = (B2) 

where for each k, H^ is a projector, hence idempotent, 
^Xk = Bi.fc, and is rank 1, so that lii^k = 
where for each i, the set {|V’i,/c)}fc is an orthonormal basis 
of the Hilbert space. If we furthermore set 

Pi^k — (B3) 

then we find p{k\Mi, Pi^k) = ET^{Ek\MiPi,k) = 1 for each 
{i,k), and consequently A = 1. We see, therefore, that 
the maximum possible value of A is attained when mea¬ 
surements satisfy the noiseless ideal. We can now con¬ 
sider the consequence of adding noise. 

We begin by considering a very simple noise model 
wherein the preparations and measurements both deviate 


from the noiseless ideal by the action of a depolarizing 
channel, that is, a channel of the form 

(B4) 

which with probability p implements the identity chan¬ 
nel and with probability 1 — p generates the completely 
mixed state. If the quantum states are the image of the 
ideal states under a depolarizing channel with parame¬ 
ter pi, and the POVM is obtained by acting the depo¬ 
larizing channel with parameter p 2 followed by the ideal 
projector-valued measure (such that the POVM elements 
are the images of the projectors under the adjoint of the 
channel), then 

Pi,k = PpiCHi^fc) = -I- (1 (B5) 

Ek\Mi = E>1^ (n*,fc) = P2n*,fc + {i-p2)^i, (B6) 

Here, the POVM {Ek\Mi}k is a mixture of {Ai^k}k and 
a POVM {i/, ij, f/, i/} which simply samples k uni¬ 
formly at random regardless of the input state. It fol¬ 
lows that for each {i,k), if we consider p{k\Mi, Pi^k) = 
Tr(i?j,|M.pi fc), we Hnd perfect predictability for the term 
having weight piP 2 while for the three other terms, we 
have a uniformly random outcome, so that in all 

p{k\Mi,Pi^k) =PiP2 + (1 -piP2)^- (B7) 

It follows that 

9 4 

^ (B8) 

i—1 k—1 

Thus a violation of the noncontextuality inequality, i.e. 
A > |, occurs if and only if 

7 

PlP2 > g- (B9) 

It turns out that one can derive similar bounds for 
more general noise models as well. Suppose that instead 
of a depolarizing channel, we have one of the form 

A/,,p(-) = pl{-)l + (I - p)p Tr(.). (BIO) 

With probability p, this implements the identity channel 
and with probability 1—p it reprepares a state p that need 
not be the completely mixed state, but which is indepen¬ 
dent of the input to the channel. The analogous sort of 
noise acting on the measurement corresponds to acting 
on the POVM elements by the adjoint of this channel, 
that is, 

A/';.p(-) = pH-)I + (1 - P)I Tr(p •)■ (Bll) 

Therefore, if this sort of noise is applied to the ideal 
states and measurements, with the parameters in each 
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noise model allowed to depend on i, we obtain 

Pr,k=Nu, + (1 

Pi jPz 

Ek\Mi = (Ili.fc) = P 2 ^Ili,k + (1 - P 2 ^)sik\i)I, 

P 2 

(B13) 


where s{k\i) = Tr(p*^*^ni_fc) is a probability distribution 
over k for each value of i. Here, the POVM {Ek\Mi}k 
is a mixture of {Ili^k}k and a POVM {s{k\i)I}k which 
simply samples k at random from the distribution s{k\i), 
regardless of the quantum state. Compared to the sim¬ 
ple model considered above, the innovation of this one is 
that for both preparations and measurements, the noise 
is allowed to be biased. 


For the case of Pi'^ = 0, which by Eq. ( |B12[ ) implies 
that pi^k = P^''\ we find that, regardless of the measure¬ 
ment, p(k\Mi, Pi^k) is just a normalized probability dis¬ 
tribution over k (because there is no k dependence in the 
state). Hence, in this case, j = i- 

Similarly, for the case of P 2 = 0, that is, when 
the POVM corresponds to a random number generator 
Ek\Mi = s{k\i)I, we find that, regardless of the prepa¬ 
ration, p{k\Mi, Pi^k) is again just a normalized proba¬ 
bility distribution over k. Hence, in this case again, 

lY.t=iPi>^\Mi,Pi,k) = I- 

It follows that for generic values of and P 2 *^ we 


have lJ2k=iPiMMi,Pi,k) = Pi P 2 + i^-Pi P 2 )\- In 


all then, we have 


Appendix C: Comparison to other noncontextuality 
inequalities 

We have proposed a technique for deriving noncontex¬ 
tuality inequalities from proofs of the Kochen-Specker 
theorem. It is useful to compare our approach with one 
that has previously been proposed by Cabello [l3]. We 
do so by explicitly comparing the two proposals in the 
case of the 18 ray construction of Ref. m- Indeed, the 
fact that Ref. m proposes an inequality for this con¬ 
struction is part of our motivation for choosing it as our 
illustrative example. 

For each of the eighteen operational equivalence classes 
of measurement events, labelled by k € {1,..., 18} as de¬ 
picted in Fig.[^ we associate a {—1, -l-l}-valued variable, 
denoted G { —1, +1}- A given ontic state A is assumed 
to assign a value to each The fact that there is only 
a single variable associated to each equivalence class im¬ 
plies that any assignment of such values is necessarily 
noncontextual. 

Ref. [13j considers a particular linear combination of 
expectation values of products of these variables: 

a = - {S 1 S 2 S 3 S 4 ) - {S 4 S 5 SeSr) - {SrSsSgSio) 

- {E10S11S12S13) — {S13S14S15S1Q) — (SieSnSisSi) 

- {SisS 2 SqSii) - {S3S5S12S14) 

- (SeSsSisSn), (Cl) 

and derives the following inequality for it: 

a<7 (C2) 


p{k\M„P,^k) 


1 

4 


3 

4 




2=1 


Consequently, a violation of the noncontextuality in¬ 
equality, i.e., A > |, occurs if and only if the noise pa¬ 
rameters satisfy 


(B15) 

Because the parameters p^^ and P 2 *^ decrease as one in¬ 
creases the amount of noise, this inequality specifies an 
upper bound on the amount of noise that can be tolerated 
if one seeks to violate the noncontextuality inequality. 

This analysis highlights how the approach to deriving 
noncontextuality inequalities described in this article has 
no trouble accommodating noisy POVMs. This contrasts 
with previous proposals for experimental tests based on 
the traditional notion of noncontextuality, which can only 
be applied to projective measurements. This is one way 
to see how previous proposals are not applicable to re¬ 
alistic experiments, where every measurement has some 
noise and consequently is necessarily not represented pro- 
jectively. 


(Note that Ref. [Oj used a labelling convention for the 
eighteen measurement events that is different from the 
one we use here; to translate between the two conven¬ 
tions, it suffices to compare Fig. 1 in that article with 
Fig. [^in ours.) Each term in a refers to a quadruple of 
variables that can be measured together, that is, which 
can be computed from the outcome of a single measure¬ 
ment. Different terms correspond to measurements that 
are incompatible. 

In Ref. [^, the following justification is given for the 
inequality (C2|. We are asked to consider the 2^® possi¬ 
ble assignments to (S*!,..., Sig) that result from the two 
possible assignments to S'„, namely —1 or -|-1, for each 
K S {1,..., 18}. It is then noted that among all such pos¬ 
sibilities, the maximum value of a that can be achieved 
is 7. 


Ref. [13] states that a violation of this inequality should 
be considered evidence of a failure of noncontextuality. 
We disagree with this conclusion, and the rest of this 
section seeks to explain why. 


1. The most natural interpretation 


It is useful to recast the inequality of Eq. (C21 in terms 


of variables with values in {0,1} rather than {—1, +1}. 
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Specifically, we take 


^. + 1 


(C3) 


Under this translation, products of the correspond to 
sums (modulo 2) of the For instance, an equation 
such as SkiSk ,2 = — 1 corresponds to the equation 0 
Vk 2 = Ij where 0 denotes sum modulo 2, while Sk,-^Sk 2 = 
0l corresponds to 0 Vk 2 = 0, so that 0 = 

^ jjj particular, we also have 


Vki 0 ® Vk3 0 Vfti 




0 1 


2 


(C4) 


or equivalently. 


S KI S 1^2 ^ 1^3 S 0 Vi^2 0 '^1^3 0 ^^^ 4 ) 1 ) (®^) 


We can therefore consider a quantity a', defined as 


a' ={vi 0 U2 0 U3 0 ^'4) 0 {vi 0 U5 0 ue 0 V7) 

0 {V 7 0 Ug 0 Wg 0 Uio) 0 (Uio 0 Vn 0 V12 0 V13) 

0 ('I'lS 0 '^14 0 '1^15 0 'I’le) 0 0 '^’17 0 "^^IS 0 'I’l) 

0 (Ui8 0 U2 0 Ug 0 Wii) 0 (t>3 0 U5 0 U12 0 U14) 

0 (u6 0 ug 0 wi 5 0 W 17 ), (C6) 


SO that a = 2a' — 9, and we can re-express inequality 
del as 


a' < 


(C7) 


Of course, rather than using Eq. (C5) to translate (C2) 


from { —1, 0l}-valued variables into {0, l}-valued vari¬ 
ables, one can also just derive the inequality (C7) di¬ 
rectly: among the 2^® possible assignments of values in 
{0,1} to each of the v^, the maximum value of a' is 8. 
Two examples of such assignments are provided in Fig. 



# : value 0 
O : value 1 


• : value 0 
0 : value 1 


FIG. 6. Examples of noncontextual assignments of {0,1}- 
values to the measurement events in Fig. [^a) where it is not 
required that every measurement has precisely one outcome 
that is assigned value 1 and three outcomes that are assigned 
the value 0. Example (a) depicts an assignment wherein there 
is a measurement all of whose outcomes receive probability 0. 
Example (b) depicts one wherein there is a measurement two 
of whose outcomes recieve probability 1. 


It is useful to use a notation that speches whether a 
given expectation value of some variable X is relative to 


a preparation procedure P, in which case it is denoted 
{X) p, or relative to an ontic state A, in which case it is 
de noted {X)x- We denote by a'{P) the quantity defined 
in (C61 if the expectation values contained therein are 


relative to preparation P, and we denote by a'(A) the 
case where the expectation values are relative to ontic 
state A. Under the assumption of an ontological model, 
each expectation value relative to a preparation P can be 
expressed as a function of the expectation value relative 
to an ontic state A, via 


(X)p = ^(X);,/r(A|P), 


(C8) 


where /i(A|P) is the distribution over ontic states asso¬ 
ciated with preparation P. We can infer from Eq. (C8) 
that 


a'(P)=^a'(A)/i(A|P). 


(C9) 


With these notational conventions, we can summarize 
the argument of Ref. m as follows. In any noncontex¬ 
tual ontological model, every ontic state A satisfies 


q;'(A) < 8. 


(CIO) 


But this in turn implies, through Eq. (C9), that for all 
preparations P, 


a'{P) < 8, 


(Cll) 


which is an inequality constraining operational quanti¬ 
ties. 

We are now in a position to describe the problem with 
the inequality (Cll I, or equivalently inequality (|C2|), and 


thus with the claim of Ref. m- First, we highlight the 
physical interpretation of the variables v^- If is as¬ 
signed value 1 by the ontic state A, then this means that 
if the system is in the ontic state A, and a measurement 
that includes n as an outcome is implemented on it, then 
the outcome k is certain to occur, while if v^, is assigned 
value 0 by A, then the outcome k is certain not to occur. 
But each of the 2^® different assignments to (ui,..., Uig) 
is such that for at least one measurement either: none 
of the outcomes occur, as in the example of Fig. |^a), 
or more than one outcome occurs, as in the example of 
Fig. §h). (This is precisely what is implied by the fact 
that the 18 measurement events are uncolourable, as ex¬ 
plained in the main text.) Such assignments involve a 
logical contradiction given that the four outcomes of each 
measurement are mutually excusive and jointly exhaus¬ 
tive possibilities. 

It follows that the sort of model that a violation of in¬ 


equality (Cll) rules out can already be ruled out by logic 


alone', no experiment is required. To put it another way, 
discovering that quantum theory and nature violate in¬ 
equality (Cll) only allows one to conclude that neither 


quantum theory nor nature involve a logical contradic¬ 
tion, which one presumably already knew prior to noting 
the violation. 
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We have argued in the main text that the notion 
of KS-noncontextuality, insofar as it assumes outcome- 
determinism, is not suitable for devising experimentally 
robust inequalities given that every real measurement in¬ 
volves some noise. The problem with inequality (Cll) 
can also be traced back to the use of the assumption of 
KS-noncontextuality. Suppose we ask the following ques¬ 
tion: given the existence of nine four-outcome measure¬ 
ments satisfying the operational equivalences of Fig.j^a), 
how are the operational probabilities that are assigned 
to these measurement events constrained if we presume 
that KS-noncontextual assignments underlie the opera¬ 
tional statistics? On the face of it, the question seems 
well-posed. On further reflection, however, one sees that 
it is not. There are simply no KS-noncontextual assign¬ 
ments to these measurement events, so it is simply im¬ 
possible to imagine that such assignments could underlie 
the operational statistics. There is nothing to be tested 
experimentally, as the hypothesis under consideration is 
seen to be false as a matter of logic. 

Here is another way to see that the inequality (Cll) 
does not provide a test of noncontextuality. Consider the 
expectation value 0 0 0 Vk,^)p for a prepara¬ 

tion P, where ki, K 2 , ks and K 4 correspond to the four 
outcomes of some measurement. Regardless of which of 
the four outcomes of the measurement occurs in a given 
run where preparation P is implemented—i.e. regardless 
of whether comes out as ( 1 , 0 , 0 , 0 ) or 

( 0 , 1 , 0 , 0 ) or ( 0 , 0 , 1 , 0 ) or ( 0 , 0 , 0 , 1 ) in that run—the vari¬ 
able Uki 0 Uk 2 0 Uk 3 ® ^Ki has the value 1. We can think 
of it this way: the variable v^i 0u„2 0 Wk 3 0 i’k 4 is a trivial 
variable because it is a constant function of the measure¬ 
ment outcome. (This is analogous to how, in quantum 
theory, for a four-outcome measurement associated with 
four projectors, although each projector is a nontrivial 
observable, their sum is the identity operator, which has 
expectation value 1 for all quantum states, and therefore 
corresponds to a trivial observable.) It follows that re¬ 
gardless of what distribution over the four outcomes is as¬ 
signed by P, the expectation value {v^i 0 'Ck 2 ®^^k 3 
will be 1. Given that each of the nine terms in a' {P) is 
of this form, it follows that a'{P) = 9. 

So, for any operational theory that admits of nine four- 
outcome measurements with the operational equivalence 
relations depicted in Fig.j^a), we will find that a' {P) = 9 
for all P. Therefore, we can conclude that the inequality 
ct' {P) < 8 is violated for all P. One can reach this con¬ 
clusion without ever considering the question of whether 
the operational predictions can be explained by some un¬ 
derlying noncontextual model. 

Another consequence of the triviality of the variables of 
the form 0 ® i^k 3 ® i'k 4 is that the inequality (Cll) 

can be violated regardless of how noisy the measurements 
are. Suppose, for instance, that quantum theory de¬ 
scribes our experiment, but that the nine four-outcome 
measurements are not the projective measurements de¬ 
scribed in Fig. 0 . but rather noisy versions thereof. For 
instance, one can imagine that each measurement is as¬ 


sociated with a positive operator-valued measure that is 
the image under a depolarizing map of the projector val¬ 
ued measure associated with the ideal measurement. The 
amount of depolarization can be taken arbitrarily large 
and, as long as it is the same amount of depolarization 
for each of the measurements, the nine noisy measure¬ 
ments that result will still satisfy precisely the same op¬ 
erational equivalences as the original nine, namely, those 
depicted in Fig. [^a). For such noisy measurements, we 
can still identify variables Vn associated to the eighteen 
equivalence classes of measurement events, and we still 
find that regardless of which of the four outcomes of the 
measurement occurs, the variable v^i 0 ® ^^3 ® '^ k 4 

has the value 1 , so that regardless of what distribution 
over the four outcomes is assigned by P, the expectation 
value (uki 0 Uk 2 0 Vks. 0 Vk^)p will be 1 and t herefo re 
a'{P) = 9, which is a violation of the inequality (Cll). 

According to the generalized notion of noncontextu¬ 
ality proposed in Ref. [5], if one adds enough noise to 
the preparations and measurements in an experiment, it 
always becomes possible to represent the experimental 
statistics by a noncontextual model. One way to prove 
this is to note that: (i) if all of the preparations and the 
measurements in an experiment admit of positive Wigner 
representations, then, as demonstrated in Ref. m, the 
Wigner representation defines a noncontextual model, 
and (ii) if one adds enough noise to the preparations and 
measurements, it is possible to ensure that they admit of 
positive Wigner representations. 

This analysis of the effect of noise accords with intu¬ 
ition: noncontextuality is meant to represent a notion of 
classicality, so that a failure of noncontextuality is only 
expected to occur in a quantum experiment if one’s ex¬ 
perimental operations have a high degree of coherence. It 
follows that there should always exist a threshold of noise 
above which an experiment cannot be used to demon¬ 
strate the failure of noncontextuality. One can turn this 
observation into a minimal criterion that should be sat¬ 
isfied by any noncontextuality inequality: there should 
exist a threshold of experimental noise above which a 
noncontextuality inequality cannot be violated. 

As we have just noted, the inequality proposed in 
Ref. [13] fails this minimal criterion. By contrast, the 
noncontextuality inequality proposed in this article iden¬ 
tifies such a threshold for the 18 ray construction: the 
noise must be kept low enough that the average of the 
measurement predictabilities is above 5/6. 


2. Alternative interpretation 

The inequality proposed in Ref. [I3] can be given a 
different interpretation to the one provided in the previ¬ 
ous subsection. This interpretation is more charitable in 
some ways, but it still does not vindicate the proposed 
inequality as delimiting the boundary of noncontextual 
models. 

The idea is to imagine that for each of the nine mea- 
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surements, there are in fact five rather than four out¬ 
comes that are mutually exclusive and jointly exhaus¬ 
tive. Thus, in this interpretation, it is assumed that the 
hypergraph describing compatibility relations and op¬ 
erational equivalences is not the one of Fig. [^a), but 
rather a modification wherein there are nine additional 
nodes—one additional node appended to each of the nine 
measurements—as depicted in Fig. [^a). 



FIG. 7. (a) The hypergraph wherein each measurement is 
assigned an additional fifth outcome, (b) A normalized non- 
contextual deterministic assignment to the hypergraph of (a) 
that recovers the subnormalized noncontextual deterministic 
assignment of Fig.|^a) on the appropriate subgraph; (c) The 
hypergraph wherein the fifth outcomes are all operationally 
equivalent; (d) the unique normalized noncontextual and de¬ 
terministic assignment to the hypergraph of (c). 


If {ki, K 2 , Ks, K 4 } are the original four outcomes of a 
given measurement, then the variable Vk,i 0 Vk .2 0 0 

is no longer a constant function of the measurement 
outcome because its value varies depending on whether 
or not the fifth outcome occurs. If K 5 denotes the fifth 
outcome of the measurement, then the trivial variable is 
Vki 0 ® ) taking the value 1 regardless 

of the outcome. 

In this case, the assignments of the type depicted in 
Fig. [^a)— the noncontextual deterministic assignments 
that are subnormalized —can be embedded into non¬ 
contextual deterministic normalized assignments on the 
larger hypergraph, as depicted in Fig. [^b). (The possi¬ 
bility of such an embedding for the subnormalized non¬ 
contextual deterministic assignments considered in Ca- 
bello, Severini and Winter m was noted in Acin, Fritz, 
Leverrier, Sainz m-) 

Of course, such a move does not provide any way of un¬ 
derstanding the deterministic noncontextual assignments 
of the type depicted in Fig.|^b), because the latter vio¬ 
late normalization by having the probabilities of the dif¬ 
ferent outcomes of the measurement summing to greater 
than 1—they are supernormalized. 

So, while the supernormalized noncontextual deter¬ 


ministic assignments can be ruled out by logic alone, the 
subnormalized noncontextual deterministic assignments 
may be entertained without logical inconsistency if they 
are considered as reductions to a subgraph of a normal¬ 
ized noncontextual deterministic assignment on a larger 
hypergraph. 

Because the justification given in Ref. [13] for the in¬ 
equality derived there asks one to consider all of the 
noncontextual deterministic assignments, including the 
supernormalized ones, the interpretation of this inequal¬ 
ity as a constraint on subnormalized assignments is in 
tension with the manner in which the inequality is jus¬ 
tified. This interpretation is a better fit with Cabello’s 
later work, such as Ref. m, wherein the restriction to 
subnormalized assignments is explicit. In any case, if the 
inequality holds for all noncontextual deterministic as¬ 
signments, regardless of normalization, then it holds for 
the special case of the subnormalized assignments, so the 
inequality can still be derived within this interpretation. 

The problem with this interpretation becomes man¬ 
ifest when we require that the original hypergraph 
of Fig. I^a)— and thus the corresponding subgraph of 
Fig. [^aj from which it is derived in this interpretation— 
is realized in terms of Hilbert-space bases in the manner 
depicted in Fig. [^a). 

We consider two possible ways of fulfiling this require¬ 
ment, and explain why it is not possible to vindicate the 
inequality of Eq. (C7) in either case. 

In one approach, we imagine that the quantum system 
is in fact described by a 5-dimensional Hilbert space. In 
this case, rank-1 projective measurements have five out¬ 
comes and are therefore described within the hypergraph 
representation by an edge with five nodes, just as we have 
for the measurements in Fig. [^a). Now consider an as¬ 
sociation of Hilbert space rays with the nodes of this hy¬ 
pergraph such that one recovers the association of rays 
to nodes described by Fig.[^on the subgraph of Fig.J^a) 
that corresponds to the original hypergraph of Fig. [^a). 
This is possible if, for every measurement, the fifth out¬ 
come is associated with a ray that is orthogonal to the 
4d subspace in which all of the other rays live. But then, 
under a tomographically complete set of preparations of 
the 5d Hilbert space, one finds that the fifth outcomes 
are all operationally equivalent, so that the appropriate 
hypergraph is not that of Fig. [^a) but rather the one 
depicted in Fig. [^c). 

Now, consider this hypergraph. It only admits of a sin¬ 
gle normalized noncontextual deterministic assignment, 
the one that assigns Os to every outcome in the origi¬ 
nal set and 1 to all of the fifth outcomes, as depicted in 
Fig. [3d). Therefore, if one were to experimentally verify 
the applicability of the hypergraph of Fig. [^c), by veri¬ 
fying the operational equivalences depicted therein, then 
any KS-noncontextual model consistent with this hyper¬ 
graph v^ld not only satisfy the inequality a'(A) < 8 
(Eq. (CIO)), it would predict that all of the measure¬ 
ment events appearing in the inequality receive value 
0 , so that the inequality could be strengthened to the 
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equality a'(A) = 0, which in turn would imply, through 
Eq. (C9), that for all preparation procedures P, the op¬ 
erational inequality a'{P) < 8 could be strengthened to 
the operational equality 


a'{P) = 0. 


(C12) 


But this is trivial to violate experimentally: simply find a 
preparation that does not always yield the fifth outcome 
for every measurement. 

We take the triviality of this constraint to speak 
against the idea that it captures the assumption of non- 
contextuality. Therefore, the conclusion to draw from 
this discussion is not that one should replace the in¬ 
equality a'{P) < 8 with a'{P) = 0. Rather, as we’ve 
argued at length in the main text, because the KS- 
noncontextual models make the unjustified assumption 
of outcome-determinism, the notion of noncontextuality 
should not be formalized as KS-noncontextuality, but 
rather as measurement and preparation noncontextual¬ 
ity. 

We now turn to the second approach. Here, one sticks 
to the notion that the quantum system being probed is 
4-dimensional and instead one suggests that each of the 
nine measurements is nonprojective, that is, each is rep¬ 
resented by a positive operator valued measure rather 
than a projector valued measure. In this way, one can 
ensure that the measurements indeed have five outcomes. 
One might even think of the fifth outcome as representing 


a ‘no detection’ event (the idea of justifying subnormal¬ 
ized assignments by imagining an additional ‘no detec¬ 
tion’ outcome has also been discussed in Ref. m)- 

To see that there is something fishy about this ap¬ 
proach, it suffices to note that if it were correct, then 
it would have the bizarre consequence that in the case 
where the measurements achieve the ideal of projective¬ 
ness, satisfaction of the inequality a'{P) < 8 is ruled 
out by logic alone, whereas if the measurements depart 
from this ideal, however little, suddenly the inequality 
specifies whether or not the experiment can be modelled 
noncontextually. 

In any case, the real problem with this approach is eas¬ 
ily identified. For a nonprojective measurement, one is 
assigning probabilities to effects (positive operators less 
than identity) rather than projectors. In this case, one 
must allow noncontextual assignments to be probabilis¬ 
tic. This has been proven elsewhere [18] and we will not 
repeat the arguments here. Such probabilistic noncontex¬ 
tual assignments are not restricted to be in the convex 
hull of the deterministic noncontextual assignments, and 
therefore can be more general than mixtures of the lat¬ 
ter. Because the derivation of the inequality a'{P) < 8 
made crucial use of the assumption that the preparation 
P was a mixture of deterministic noncontextual assign¬ 
ments, the fact that the assumption of determinism is 
unwarranted implies that one can no longer derive the 
inequality as a constraint on noncontextual models. 



